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Abstract —A closed-form expression for a lower bound on the 
per soliton capacity of the nonlinear optical fibre channel in the 
presence of (optical) amplifier spontaneous emission (ASE) noise 
is derived. This bound is based on a non-Gaussian conditionaf 
probability density function for the soliton amplitude jitter 
induced by the ASE noise and is proven to grow logarithmically 
as the signal-to-noise ratio increases. 

1. Introduction 

It is widely accepted that in order to meet the ever-growing 
demand for data rates in fibre-optic telecommunication sys¬ 
tems, the spectral efficiency of the optical fibre transmission 
system needs to be increased ijT). The key physical effects 
distinguishing a fibre optical system from a free space trans¬ 
mission are: dispersion, nonlinearity and optical noise El¬ 
ia. The implementation of the “fifth generation” of optical 
transceivers and networks operating with coherent detection, 
advanced multilevel modulation formats, and digital signal 
processing techniques, has led to the possibility of channel 
rates exceeding 100 Gbit/s Q- The key to this breakthrough 
is the mitigation of linear transmission impairments, such as 
chromatic and polarization mode dispersion. 

The performance of current coherent systems is limited by 
noise and nonlinearity. In contrast to linear channels, however, 
spectral efficiencies for the optical fibre channel usually exhibit 
a peak and decay at high input powers; this is often referred 
to as the “nonlinear Shannon limit” 0, M- This behaviour is 
caused by the Kerr nonlinearity and is believed to ultimately 
lead to a “capacity crunch” m, i.e., to the inability of the 
optical network infrastructure to cope with the increasing 
capacity demand. 

The capacity analysis of the nonlinear channel relies on 
well-established methods of information theory 0,1101. How¬ 
ever, most of the analytical results obtained to date concern 
linear channel models, and hence, are not directly applicable to 
nonlinear optical channels. Despite numerous efforts to define 
the influence of Kerr nonlinearity on the channel capacity 0, 
0, M-M, the capacity of the nonlinear optical channel 
still remains as an open research problem. Most of the capacity 
bounds presented in the literature typically display a peaky 
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behaviour, where the maximum is reached at a finite threshold 
power. To the best of our knowledge, the first nondecaying 
(lower) bound on the capacity of the nonlinear optical fibre 
channel (with zero average dispersion) was presented in HD- 
Other nondecaying bounds include, e.g., those given recently 
in Ha and lEl, llTOl . 

A multitude of different nonlinearity mitigation techniques 
have been proposed over recent years to suppress nonlinearity- 
induced distortions. This includes receiver-based digital signal 
processing ED, digital back-propagation ll22l . optical phase 
conjugation ll23l . twin-waves phase conjugation ll24l . etc. 
However, there are still many limitations and further chal¬ 
lenges in applying these methods. A promising alternative for 
nonlinearity compensation is the nonlinear Eourier transform 
(NLET) developed in the 70’s ll25l . Il2^ . The applications 
of the NLET in optical communication originates from the 
pioneering work of Hasegawa and Nyu ll27l . an approach 
that has been extended in a number of recent works EU- 
E3- Notably, an experimental demonstration of a NLET-based 
transmission was recently presented by Billow in ll^ . 

The use of NLET for nonlinearity compensation in optical 
fibre links is possible because the master model governing 
signal propagation in a single mode optical fibre (in the ab¬ 
sence of noise and loss) is the nonlinear Schrodinger equation 
(NLSE) 0-0 that belongs to the class of integrable (i.e., 
completely solvable) evolutionary equations ll25l . The solution 
method can be considered as the generalisation of the linear 
Eourier transform (ET) operation onto the nonlinear (inte¬ 
grable) system, hence the name NLET. Similarly to the ET, 
the NLET decomposes a waveform in the NLSE space-time 
domain into the nonlinear normal modes inside the nonlinear 
spectral domain 1291, llTSl . The key underlying feature of the 
NLET transmission is that these nonlinear modes (nonlinear 
signal spectrum) propagate without cross-talk, effectively in a 
linear manner. Thus, the nonlinear spectrum can be used for 
encoding and efficient transmission of information over the 
nonlinear fibre. 

The original work by Hasegawa and Nyu 1271 introduced 
the concept of “eigenvalue communications”, where the in¬ 
formation was encoded using discrete eigenvalues associated 
with the solitonic degrees of freedom emerging from the NLET 


signal decomposition 0 (see also ||2^ ). In the absence of both 
loss and noise, the evolution of nonlinear modes is inherently 
free from any nonlinear impairments, including a nonlinear 
cross-talk. The loss in optical links is usually compensated 
by using lumped or distributed amplification; in our case 
we assume the ideal distributed Raman amplification scheme 
resulting in the lossless NLSE 0, HOl . However, the 
signal will still be distorted by amplifier-induced spontaneous 
emission (ASE) as well as signal-noise beating. 

In this paper, we study the channel capacity (in bits per 
soliton symbolic for a transmission system based on optical 
solitons (sufficiently separated in time domain) launched into 
a noisy NLSE channel. The information is assumed to be 
encoded in the soliton’s amplitude only, which can be extracted 
from the imaginary part of the discrete eigenvalue emerging 
from the NLET signal decomposition. We consider a discrete¬ 
time continuous-input continuous-output channel model, based 
on the asymptotically exact non-Gaussian marginal statistics 
of the soliton amplitude in the presence of weak ASE noise 
presented in ll4Tll - ll43l . We emphasise that the capacity estima¬ 
tions for such fundamentally nonlinear channels are quite few 
and far between. Notable exceptions are the works by Yousefi 
and Kschischang Il3n and Meron et al. ||33l. While in ||3T1 
the channel statistics were assumed a priori to be Gaussian 
EU eq. (27)], in a tight lower bound on the channel 
capacity as a function of the signal to noise ratio (SNR) was 
not provided. 

The discrete-time channel model governing transmission 
systems based on optical solitons is a noncentral chi-squared 
distribution with four degrees of freedom ll42ll . ||43]| . Based on 
this model we obtain an asymptotically growing lower bound 
for the channel capacity vs. SNR. This bound is similar to the 
one in Oil, where a noisy nonlinear optical fibre channel with 
zero fibre dispersion was considered. The results in this paper 
show that the reachable capacity limits for existing optical 
fibre channels could have been previously underestimated. 


II. The Master Equation and the Noncentral 
Chi-squared Channel Model 


A. Waveform channel 

We consider propagation of a slowly varying envelope 
signal formed by a sequence of solitons transmitted every 
Ts [s] over a nonlinear optical fibre. Our model combines 
the effects of chromatic dispersion (we consider the case of 
anomalous dispersion), instantaneous Kerr nonlinearity, and 
ASE noise due to optical Raman amplification. The fibre loss 
is assumed to be continuously compensated along the fibre by 
means of ideal Raman amplification and hence is set to zero 

0, ED, Eo). 

The noise-perturbed NLSE in dimensionless units is given 
by 0 , 0 , m, 


■ dq{z,t) 
* dz 


2 


\q{z,t)\‘^q{z,t) = n{z,t), ( 1 ) 


^The practically more relevant problem of channel capacity in [bit/s/Hz] is 
left for future investigation. 


(a) Discrete-time channel 



(b) Discrete-time channel 



Fig. 1. System model: (a) Transmitter, continuous-channel model governed 
by (D, and receiver, (b) Equivalent discrete-time channel model. 

where t is the time normalised by Tg, z is the distance along 
the fibre normalised by Lg = -j^ (not to be confused with 
the dispersion length), and ^2 < 0 is the group velocity 
dispersion coefficient. We also define s{t) — q{0,t) and 
r{t) = q{L,t) as the input and output waveforms of the 
physical channel after transmission distance L, respectively, 
normalised by the nonlinear power scale where 7 is 

the nonlinearity coefficient. The relationship between s(t) and 
r{t) is schematically shown in the inner part of Eig. [Ha). 

The noise term n{t, z) on the right-hand side of ([T]i is 
assumed to be a zero-mean (E [n{z, t)] = 0) circularly- 
symmetric additive white Gaussian noise (AWGN) process 
with autocorrelation function 0 eq. (53)] 

E [n{z, t) n*{z', t')] = 2D d{z — z') 6(t — t'), (2) 

where * denotes complex conjugation, E [•] is the mathematical 
expectation operator, and (5(-) is the Dirac’s delta function. 
Here the noise intensity D is written in dimensionless units E) 
eq. (5.29)] as D = 'yL‘laQ/2Ts. Eor ideal distributed Raman 
amplification, the power spectral density of the ASE noise 
is defined as 0 eq. (56)] ctq = aKr ■ hvopt, where a is 
the fibre attenuation coefficient, /ir'opt is the average photon 
energy, and Kt ~ 1 is the coefficient that characterizes the 
Raman pump providing the gain 0. 

It is known that the noiseless NLSE (i.e., ([T]i with n{z, f) = 
0 ) possesses a special class of solutions, the so-called funda¬ 
mental bright solitons 0 - 0 . At z = 0 we write it as 0 
eq. (1.40)] (in normalised units) 

g(0,f) = Aosech(Aof), (3) 

where Aq denotes the normalised soliton amplitude and we 
assume that the initial soliton frequency, phase and centre- 
of-mass position are set to zero. The unperturbed soliton 
solution © at a distance z = L is given by q{L, f) = 
Aq sech(Aof) eyip{iApL/2). 


















































B. Discrete-time channel 

The (normalised) continuous-time input signal s{t) is de¬ 
fined as 

OO 

s{t) = ^Sk{t), (4) 

k^l 

where 

Sk{t) = Aok sech[Aofe(t - k)], (5) 

and k is the discrete-time index. At each discrete time k, 
the transmitter maps an amplitude Agfe to Sk{t) via Q. For 
simplicity of the following analysis, however, we consider the 
square root of the amplitudes, i.e., Xk = 

The dimensionless energy of the feth soliton waveform is 
defined as 

(fc+l/2) 

EiAok) = j |sfc(f)pdf. (6) 

-(fe-l/2) 


We consider the regime where the inter-soliton separation 
is much larger than the typical soliton width (low duty cycle), 
so the integral in (|6]l can be taken over (— 00 , 00 ). This yields 
the well-known linear energy-amplitude scaling of the soliton 
pulse E{AQk) = 2Aofc. The minimum inter-soliton separation 
is then determined by the peak power A^f. of each individual 
soliton, which is in turn inversely proportional to the square 
of its width Tok = 

The receiver in Fig. 1(b) processes the received waveform 
r{t) during a window of length one via the forward NLFT 
and returns the amplitude of the received soliton. We assume 
ideal NLFT-detection, i.e., the sampling rate is high enough 
to ignore NLFT finite accuracy issues arising from a partic¬ 
ular algorithmic realisation GqI compared to noise-induced 
distortions. The inter-soliton separation is also assumed to be 
large enough so that there is no interaction between adjacent 
solitons, i.e., exp(—<C 1, or equivalently, 1 ^ Tok- An¬ 
other source of corruption for the soliton-based transmission 
system emanates from the Gordon-Haus (GH) timing jitter 151 , 
which defines the standard deviation of the soliton 

position as a function of the propagation distance and soliton 
amplitude. To avoid interaction between adjacent solitons, the 
GH timing jitter should also be taken into account 0. For 
a given propagation distance L, the inter-soliton separation 
must fulfill 1 > Tok -f AT^^. This condition guarantees 
that solitons behave as isolated pulses, and thus, there is no 
intersymbol interference. We assume that this condition is 
satisfied throughout this paper, and thus, from now on we 
drop the time index k. 

The exact conditional PDF for a single received amplitude 
A given a transmitted amplitude Aq is written as ll42l eq. (24)] 
(see also ill) 


PA\AoHao) 



/V erg, J 


(7) 


where erg, = LLj^D/2 is the normalised variance of the 
accumulated ASE noise and Ii{x) is the modified Bessel 
function of the first kind. Expression (|7]i is in fact the same 


PDE obtained assuming an energy-detection receiver (i.e., a 
receiver based on (|6ll), as shown in a eq. (5.501)]. 

Equation (|7]) is nothing else but a special case of a non¬ 
central chi-squared distribution with four degrees of freedom 
providing non-Gaussian statistics for soliton amplitudes. Eor 
future use, it is convenient to designate the output of the 
discrete-time channel model Y as the square root of the output 
soliton amplitudes A. By making a change of variables, the 
PDE (|7]i can be rewritten as 


2 ( 

PY\xKy\x) = — — exp I - 
(Tn a; \ 


y 


J ( 2 a;y 

V o-gi 


( 8 ) 


The conditional PDE in ([8]l describes a channel with the 
input-output relation 



where Ni, i = 1,2 ,3,4 are four independent and identi¬ 
cally distributed zero-mean Gaussian random variables with 
variance erg = ag,. The input-output relationship in (|9) is 
schematically shown in Eig. 1(b). 


III. Main results 

Since the soliton pulses are assumed to be well separated 
and the intersymbol interference due to pulse interaction can 
be neglected, the model (|8]l describes a scalar memoryless 
channel. The channel capacity is then defined as 13, M 

C = max IxY, (10) 

Px 

where Ixy is the mutual information (MI) and the optimiza¬ 
tion is performed over all possible input distributions px with 
fixed average symbol energy E [i?(Ao)]. The MI Ixy can be 
decomposed as 13, 03 


Ixy = hy — hy^x, 


( 11 ) 


where hy and hy\x are the output and conditional differential 
entropies, respectively. 

The SNR is defined as E eq. (29)] 


SNR^ 


E [E{Ao)] 
crgfTs 


2 KCTg 


( 12 ) 


where Cg is the average amplitude Cg = E [Aq] = E and 
K is the ratio between the available bandwidth and the symbol 
rate I/Tg. Thus, for a fixed bandwidth and symbol rate, the 
SNR is proportional to the parameter p = erg/erg,. We shall 
henceforth consider the capacity and MI as a function of p. 

The exact solution for the power constrained optimization 
problem (doll with the channel model (H is unknown. To 
obtain a lower bound on the capacity, we shall assume the 
input symbols X are drawn from a trial input distribution. In 
this work we use the Rayleigh PDE 


, , 2a; I X \ 

px(a;) = —exp - 21 (13) 

which leads to exponentially-distributed soliton amplitudes Aq 
with mean erg. 







The next two Lemmas provide exact closed-form expres¬ 
sions for the output differential entropy hy of symbols Y 
with input symbols X distributed according to (fOl) and for 
the conditional differential entropy hy^x- 

Lemma 1: For the channel in @ and the input distribution 

(HI 

hy = log - log \/l + p-^ - P~^ log y/TTp 

+ p + '0(p”^)-1, (14) 

where ip{x) = ^lnr(a;) is the digamma function, and r(a;) 
is the gamma function. 

Lemma 2: For the channel in (0 and the input distribution 

(HI 

hy\x = log \f^ -f 2 (1 -f p) - (1 -f p~^) log (1 -f p) 

- p~^^Jl + p-^ F{p) - -log2, (15) 

where 

OO 

F{p)^ y'eiTi(x/TT^ 0 ^i(C)log[/i( 0 ]de, (16) 
0 

and Ki (x) is the modified Bessel function of the second kind 
of order one. 

Sketch of the proof: To prove both lemmas, the output 
distribution py{y) = pY\x{y\x)px{x)dx is calculated 
using (0 and (fTJt . The derived output PDF py{y) is then used 
in the definitions of differential entropies. The results of both 
Lemmas are then obtained by evaluating the corresponding 
integrals. The calculation follows closely that from the earlier 
work HU, where calculations were performed for a chi- 
squared distribution with two degrees of freedom (cf. (fThl l and 
El eq. (24)]). ■ 

We note that the proof of Lemma 2 includes finding a 
closed-form expression for the differential entropy of a chi- 
squared distribution with four degrees of freedom. To the best 
of our knowledge, this has never been previously reported 
in the literature^ The results from Lemmas 1 and 2 can be 
combined to produce the following theorem. 

Theorem 1: For the channel (|9]l and the input distribution 

HI 

IxY =log (p\/l-f p-l) + p~^\og{yJlF p) - P 

+ p-Vl + p-^ F{p) + ^(p-1) - V-(l) - 1. (17) 

Proof: From Lemmas 1 and 2 and (fTTT i. ■ 

The results of Lemma 1, Lemma 2, and Theorem 1 are illus¬ 
trated in Fig.m Analytical curves for the functions hy, hy^Xi 
and IxY are compared with results obtained via numerical 
integration. 

^However, a closed-form expression for the expected-log of a noncentral 
chi-squared distribution with even number of degrees of freedom was given 
in El Lemma 10.1]. 



p [dB] 

Fig. 2. The differential entropy of output disti'ibution hy in (14), the 
differential conditional entropy hy^x (H). and the MI IxY in di). 
Results obtained via numerical integration are also shown (circles). 



Fig. 3. The ratio between the MI IxY in (13 and the function /as in (H). 

The next theorem shows that the capacity lower bound is 
asymptotically equivalent to half the logarithm of the SNR, 
which is the main result of our work. 

Theorem 2: The MI Ixy in ( fTTI ) satisfies 

lim = 1 , (18) 

p-s-oo Ixy 

where 

4s = ^logp. (19) 

Proof: The proof follows from an asymptotic expansion 
of Ixy in (fTTl i together with the asymptotic expansion of (fThl) 
provided in IflTl . ■ 

Fig. m shows the numerical evaluation of the ratio 
Ins,/I xy and confirms that the MI behaves asymptotically 
as (l/2)logp, or equivalently, as (l/2)logSNR. According 
to Fig. [3 the asymptotic function ( fT9b approaches the MI 
from above. Interestingly, the expression ( fT9b has appeared in 
asymptotic analyses of optical systems (see e.g., nil eq. (25)1, 
M Sec. V-A], ll45l eq. ( 6 )]). Since the channel capacity is 
lower-bounded by Ixy 7 this result implies that the capacity 
grows at least as fast as (1/2) log SNR, when SNR —>■ oo. 

IV. Conclusions 

By using a rigorous channel model based on the exact 
conditional PDF for the soliton amplitudes in (|7]i, an exact 
closed-form expression for a lower bound on the capacity of 























the nonlinear optical fibre channel with no inline dispersion 
compensation was derived. It has been analytically demon¬ 
strated that the lower bound on the capacity for the channel 
based on the individual amplitudes of well separated solitons 
displays an unbounded growth similarly to the linear Gaussian 
channel. 
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